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RAW SEQUENCE LISTING DATE : 01/16/2002 

PATENT APPLICATION: US/09/955 f 909 TIME: 11:05:49 

Input Set : N:\Crf3\RULE60\09955909.raw 
Output Set: N:\CRF3\01162002\I955909.xaw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: PELLETIER, Marc 

5 BARKER, William A. 

7 HAKES, David J. 

8 ZOPF, David A. 
10 (ii) TITLE OF INVENTION: METHODS FOR PRODUCING 

1± SIALYLOLIGOSACCHARIDES IN A DAIRY SOURCE 

13 (iii) NUMBER OF SEQUENCES: 10 

15 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: PENNIE & EDMONDS LLP 

17 (B) STREET: 1155 Avenue of the Americas 

18 (C) CITY: New York 

19 (D) STATE: NY 

20 ( E ) COUNTRY: USA 

21 (F) ZIP: 10036-2711 

23 (v) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Diskette 

25 (B) COMPUTER: IBM Compatible 

26 (C) OPERATING SYSTEM: DOS 

27 (D) SOFTWARE: FastSEQ Version 2.0 
29 (vi) CURRENT APPLICATION DATA: 

c __> 30 (A) APPLICATION NUMBER: US/0 9/955,909 

C--> 31 (B) FILING DATE: 18-Sep-2001 

32 (C) CLASSIFICATION: 

34 (vii) PRIOR APPLICATION DATA: 

35 (A) APPLICATION NUMBER: 08/911,393 

36 (B) FILING DATE: 

3 8 (viii) ATTORNEY/AGENT INFORMATION: 

39 (A) NAME: Coruzzi, Laura A 

40 (B) REGISTRATION NUMBER: 30,742 

41 (C) REFERENCE/DOCKET NUMBER: 7188-032-999 

4 3 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: (212)7909090 

45 (B) TELEFAX: (212)8699741 

46 (C) TELEX: 66141 PENNIE 
49 (2) INFORMATION FOR SEQ ID NO: 1: 

51 (i) SEQUENCE CHARACTERISTICS: 

52 (A) LENGTH: 3183 base pairs 

53 (B) TYPE: nucleic acid 

54 (C) STRANDEDNESS : single 

55 (D) TOPOLOGY: linear 
57 (ii) MOLECULE TYPE: cDNA 

59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

61 ATGGGGAAAA CAGTCGTTGG GGCCAGTAGG ATGTTCTGGC TAATGTTTTT CGTGCCGCTT 60 

62 CTTCTTGCGC TCTGCCCCAG CGAGCCCGCG CATGCCCTGG CACCCGGATC GAGCCGAGTT 120 

63 GAGCTGTTTA AGCGGCAAAG CTCGAAGGTG CCATTTGAAA AGGGCGGCAA AGTCACCGAG 180 
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64 CGGGTTGTCC ACTCGTTCCG CCTCCCCGCC CTTGTTAATG TGGACGGGGT GATGGTTGCC 240 

6 5 ATCGCGGACG CTCGCTACGA AACATCCAAT GACAACTCCC TCATTGATAC GGTGGCGAAG 300 

66 TACAGCGTGG ACGATGGGGA GACGTGGGAG ACCCAAATTG CCATCAAGAA CAGTCGTGCA 3 60 

67 TCGTCTGTTT CTCGTGTGGT GGATCCCACA GTGATTGTGA AGGGCAACAA GCTTTACGTC 4 20 

68 CTGGTTGGAA GCTACAACAG TTCGAGGAGC TACTGGACGT CGCATGGTGA TGCGAGAGAC 4 80 

69 TGGGATATTC TGCTTGCCGT TGGTGAGGTC ACGAAGTCCA CTGCGGGCGG CAAGATAACT 540 

7 0 GCGAGTATCA AATGGGGGAG CCCCGTGTCA CTGAAGGAAT TTTTCCCGGC GGAAATGGAA 600 
71 GGAATGCACA CAAATCAATT TCTTGGCGGT GCAGGTGTTG CCATTGTGGC GTCCAACGGG 660 
7 2 AATCTTGTGT ACCCTGTGCA GGTTACGAAC AAAAAGAAGC AAGTTTTTTC CAAGATCTTC 7 20 

73 TACTCGGAAG ACGAGGGCAA GACGTGGAAG TTTGGGGAGG GTAGGAGTGA TTTTGGCTGC 780 

74 TCTGAACCTG TGGCCCTTGA GTGGGAGGGG AAGCTCATCA TAAACACTCG AGTTGACTAT 840 

75 CGCCGCCGTC TGGTGTACGA GTCCAGTGAC ATGGGGAATT CGTGGGTGGA GGCTGTCGGC 900 
7 6 ACGCTCTCAC GTGTGTGGGG CCCCTCACCA AAATCGAACC AGCCCGGCAG TCAGAGCAGC 960 

77 TTCACTGCCG TGACCATCGA GGGAATGCGT GTTATGCTCT TCACACACCC GCTGAATTTT 102 0 

78 AAGGGAAGGT GGCTGCGCGA CCGACTGAAC CTCTGGCTGA CGGATAACCA GCGCATTTAT 1080 

79 AACGTTGGGC AAGTATCCAT TGGTGATGAA AATTCCGCCT ACAGCTCCGT CCTGTACAAG 1140 

80 GATGATAAGC TGTACTGTTT GCATGAGATC AACAGTAACG AGGTGTACAG CCTTGTTTTT 1200 

81 GCGCGCCTGG TTGGCGAGCT ACGGATCATT AAATCAGTGC TGCAGTCCTG GAAGAATTGG 1260 

82 GACAGCCACC TGTCCAGCAT TTGCACCCCT GCTGATCCAG CCGCTTCGTC GTCAGAGCGT 1320 

83 GGTTGTGGTC CCGCTGTCAC CACGGTTGGT CTTGTTGGCT TTTTGTCGCA CAGTGCCACC 1380 

84 AAAACCGAAT GGGAGGATGC GTACCGCTGC GTCAACGCAA GCACGGCAAA TGCGGAGAGG 144 0 

85 GTTCCGAACG GTTTGAAGTT TGCGGGGGTT GGCGGAGGGG CGCTTTGGCC GGTGAGCCAG 1500 

86 CAGGGGCAGA ATCAACGGTA TCACTTTGCA AACCACGCGT TCACGCTGGT GGCGTCGGTG 1560 

87 ACGATTCACG AGGTTCCGAG CGTCGCGAGT CCTTTGCTGG GTGCGAGCCT GGACTCTTCT 1620 

88 GGTGGCAAAA AACTCCTGGG GCTCTCGTAC GACGAGAAGC ACCAGTGGCA GCCAATATAC 1680 

89 GGATCAACGC CGGTGACGCC GACCGGATCG TGGGAGATGG GTAAGAGGTA CCACGTGGTT 1740 

90 CTTACGATGG CGAATAAAAT TGGTTCGGTG TACATTGATG GAGAACCTCT GGAGGGTTCA 1800 

91 GGGCAGACCG TTGTGCCAGA CGGGAGGACG CCTGACATCT CCCACTTCTA CGTTGGCGGG 1860 

92 TATGGAAGGA GTGATATGCC AACCATAAGC CACGTGACGG TGAATAATGT TCTTCTTTAC 1920 

93 AACCGTCAGC TGAATGCCGA GGAGATCAGG ACCTTGTTCT TGAGCCAGGA CCTGATTGGC 1980 

94 ACGGAAGCAC ACATGGGCAG CAGCAGCGGC AGCAGTGCCC ACAGTACGCC CTCAACTCCC 2040 

95 GCTGACAACG GTGCCCACAG TACGCCCTCA ACTCCCGCTG ACAGCAGTGC CCACAGTACG 2100 

96 CCCTCAACTC CCGCTGACAG CAGTGCCCAC AGTACGCCCT CAGCTCCCGG TGACAACGGT 2160 

97 GCCCACAGTA CGCCCTCGAC TCCCGGTGAC AGCAGTGCCC ACAGTACGCC CTCAACTCCC 2220 

98 GCTGACAACG GTGCCCACAG TACGCCCTCA GCTCCCGCTG ACAGCAATGC CCACAGTACG 2280 

99 CCCTCGACTC CCGCTGACAA CGGTGCCCAC AGTACGCCCT CAACTCCCGC TGACAACGGT 2 340 

100 GCCCACAGTA CGCCCTCGAC TCCCGGTGAC AACGGTGCCC ACAGTACGCC CTCGACTCCC 

101 GGTGACAGCA GTGCCCACAG TACGCCCTCA ACTCCCGCTG ACAACGGTGC CCACAGTACG 

102 CCCTCAGCTC CCGCTGACAG CAATGCCCAC AGTACGCCCT CGACTCCCGG TGACAACGGT 

103 GCCCACAGTA CGCCCTCAGC TCCCGCTGAC AGCAATGCCC ACAGTACGCC CTCGACTCCC 2580 

104 GGTGACAGCA GTGCCCACAG TACGCCCTCA GCTCCCGGTG ACAACGGTGC CCACAGTACG 2640 

105 CCCTCAGCTC CCGCTGACAG CAGTGCCCAC AGTACGCCCT CAGCTCCCGG TGACAACGGT 2700 

106 GCCCACAGTA CGCCCTCAGC TCCCGCTGAC AACGGTGCCC ACAGTACGCC CTCAGCTCCC 2760 

107 GGTGACAGCA ATGCCCACAG TACGCCCTCG ACTCCCGCTG ACAGCAGTGC CCACAGTACG 

108 CCCTCAACTC CCGCTGACAG CAGTGCCCAC AGTACGCCCT CAGCTCCCGG TGACAACGGT 

109 GCCCACAGTA CGCCCTCAGC TCCCGCTGAC AGCAGTGCCC ACAGTACGCC CTCAATTCCC 

110 GGTGACAGCA GTGCCCACAG TACGCCCTCA GCTCCCGCTG ACAGCAGTGC CCACAGTACG 

111 CCCTCAGCTC CCGGTGACAA CGGTGCCCAC AGTACGCCCT CGACTCCCGC TGACAACGGC 3060 

112 GCTAATGGTA CGGTTTTGAT TTTGCACGAT GGCGCTGCAT TTTCGGCCTT TTCGGGCGGA 



2400 
2460 
2520 



2820 
2880 
2940 
3000 
3060 
3120 
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113 GGGCTTCTTT TGTGTGCGGG TGCTTTGCTG CTGCACGTGT TCGTTATGGC AGTTTTTTTC 3180 

3183 

114 TGA 

116 (2) INFORMATION FOR SEQ ID NO : 2: 

118 (i) SEQUENCE CHARACTERISTICS: 

119 (A) LENGTH: 1060 amino acids 

120 (B) TYPE: amino acid 

121 (C) STRANDEDNESS: single 

122 (D) TOPOLOGY: linear 
124 (ii) MOLECULE TYPE: protein 

126 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

128 Met Gly Lys Thr Val Val Gly Ala Ser Arg Met Phe Trp Leu Met Phe 

129 1 5 10 15 
Phe Val Pro Leu Leu Leu Ala Leu Cys Pro Ser Glu Pro Ala His Ala 

20 25 30 

132 Leu Ala Pro Gly Ser Ser Arg Val Glu Leu Phe Lys Arg Gin Ser Ser 

133 35 40 45 

134 Lys Val Pro Phe Glu Lys Gly Gly Lys Val Thr Glu Arg Val Val His 

135 50 55 60 

136 Ser Phe Arg Leu Pro Ala Leu Val Asn Val Asp Gly Val Met Val Ala 

137 65 70 75 80 

138 He Ala Asp Ala Arg Tyr Glu Thr Ser Asn Asp Asn Ser Leu He Asp 

85 90 95 

Thr Val Ala Lys Tyr Ser Val Asp Asp Gly Glu Thr Trp Glu Thr Gin 

100 105 HO 

142 He Ala He Lys Asn Ser Arg Ala Ser Ser Val Ser Arg Val Val Asp 

143 115 120 125 

144 Pro Thr Val He Val Lys Gly Asn Lys Leu Tyr Val Leu Val Gly Ser 

145 130 135 140 



130 
131 



139 
140 
141 



14 6 Tyr Asn Ser Ser Arg Ser Tyr Trp Thr Ser His Gly Asp Ala Arg Asp 

145 150 155 160 

Trp Asp He Leu Leu Ala Val Gly Glu Val Thr Lys Ser Thr Ala Gly 

165 170 175 

150 Gly Lys He Thr Ala Ser He Lys Trp Gly Ser Pro Val Ser Leu Lys 



147 145 
148 
149 



151 



159 
160 
161 



180 185 190 



152 Glu Phe Phe Pro Ala Glu Met Glu Gly Met His Thr Asn Gin Phe Leu 

153 195 200 205 

154 Gly Gly Ala Gly Val Ala He Val Ala Ser Asn Gly Asn Leu Val Tyr 

155 210 215 220 

156 Pro Val Gin Val Thr Asn Lys Lys Lys Gin Val Phe Ser Lys He Phe 

157 225 230 235 240 

158 Tyr Ser Glu Asp Glu Gly Lys Thr Trp Lys Phe Gly Glu Gly Arg Ser 

245 250 255 

Asp Phe Gly Cys Ser Glu Pro Val Ala Leu Glu Trp Glu Gly Lys Leu 

260 265 270 

162 He He Asn Thr Arg Val Asp Tyr Arg Arg Arg Leu Val Tyr Glu Ser 

163 275 280 285 

164 Ser Asp Met Gly Asn Ser Trp Val Glu Ala Val Gly Thr Leu Ser Arg 

165 290 295 300 

166 Val Trp Gly Pro Ser Pro Lys Ser Asn Gin Pro Gly Ser Gin Ser Ser 
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167 305 310 315 320 

168 Phe Thr Ala Val Thr lie Glu Gly Met Arg Val Met Leu Phe Thr His 

325 330 
"o Pro Leu Asn Phe Lys Gly Arg Trp Leu Arg Asp Arg Leu Asn Leu Trp 

340 

^72 Leu Thr Asp Asn Gin Arg He Tyr Asn Val Gly Gin Val Ser lie Gly 

173 355 360 365 

174 Asp Glu Asn Ser Ala Tyr Ser Ser Val Leu Tyr Lys Asp Asp Lys Leu 

^76 Tyr Cys Leu His Glu lie lln Ser Asn Glu Val Tyr Ser Leu Val Phe 

178 III Arg Leu Val Gly III Leu Arg He lie Lys Ser Val Leu Gin Ser 

179 405 410 !; » 

180 Trp Lys Asn Trp Asp Ser His Leu Ser Ser lie Cys Thr Pro Ala Asp 

III Pro Ala Ala Ser Ser Ser Glu Arg Gly Cys Gly Pro Ala Val Thr Thr 

435 440 445 

Val Gly Leu Val Gly Phe Leu Ser His Ser Ala Thr Lys Thr Glu Trp 

III Glu Asp Ala Tyr Arg Cys Val Asn Ala Ser Thr Ala Asn Ala Glu Arg 

470 475 
188 Val Pro Asn Gly Leu Lys Phe Ala Gly Val Gly Gly Gly Ala Leu Trp 

485 ^ ^0 

190 Pro val Ser Gin Gin Gly Gin Asn Gin Arg Tyr His Phe Ala Asn His 

500 ^05 510 

192 Ala Phe Thr Leu Val Ala Ser Val Thr lie His Glu Val Pro Ser Val 

515 520 52 

Ala Ser Pro Leu Leu Gly Ala Ser Leu Asp Ser Ser Gly Gly Lys Lys 

530 535 
196 Leu Leu Gly Leu Ser Tyr Asp Glu Lys His Gin Trp Gin Pro lie Tyr 

550 555 jdu 

III III Ser Thr Pro Val Thr Pro Thr Gly Ser Trp Glu Met Gly Lys Arg 

199 565 570 575 

200 Tyr His Val Val Leu _. 

201 580 585 590 



Tvr His Val Val Leu Thr Met Ala Asn Lys He Gly Ser Val Tyr lie 

580 585 590 

Asp Gly Glu Pro Leu Glu Gly Ser Gly Gin Thr Val Val Pro Asp Gly 



202 

203 595 
204 



600 605 



Arg Thr Pro Asp lie Ser His Phe Tyr Val Gly Gly Tyr Gly Arg Ser 

206 Asp Met Pro Thr He Ser His Val Thr Val Asn Asn Val Leu Leu Tyr 

630 635 o**u 

207 Asn Arg Gin Leu Asn Ala Glu Glu He Arg Thr Leu Phe Leu Ser Gin 

645 650 655 

Asp Leu lie Gly Thr Glu Ala His Met Gly Ser Ser Ser Gly Ser Ser 



208 
209 
210 

211 660 



665 670 



212 Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly Ala His Ser Thr 

675 680 685 

2^4 Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser Thr Pro Ser Thr Pro 
215 690 695 700 
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216 

217 

218 

219 

220 

221 

222 

223 

224 

225 

226 

227 

228 

229 

230 

231 

232 

233 

234 

235 

236 

237 

238 

239 

240 

241 

242 

243 

244 

245 

246 

247 

248 

249 

250 

251 

252 

253 

254 

255 

256 

257 

258 

259 

260 

261 

263 

265 

266 



Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro Gly 

705 710 715 

Ala His Ser Thr Pro Ser Thr Pro Gly Asp Ser Ser Ala 

725 730 
Pro Ser Thr Pro Ala Asp Asn Gly Ala His Ser Thr Pro 

Ala Asp Ser III Ala His Ser Thr Pro Ser Thr Pro Ala 

755 760 
Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly Ala 

770 775 780 

Pro Ser Thr Pro Gly Asp Asn Gly Ala His Ser Thr Pro 

785 790 795 

Gly Asp Ser Ser Ala His Ser Thr Pro Ser Thr Pro Ala 

805 810 
Ala His Ser Thr Pro Ser Ala Pro Ala Asp Ser Asn Ala 

82 5 

Pro Ser Thr Pro Gly Asp Asn Gly Ala His Ser Thr Pro 
835 840 845 

Ala Asp Ser Asn Ala His Ser Thr Pro Ser Thr Pro Ala 

850 855 860 

Ala His Ser Thr Pro Ser Ala Pro Gly Asp Asn Gly Ala 
865 870 875 

Pro Ser Ala Pro Ala Asp Ser Ser Ala His Ser Thr Pro 

885 890 
Gly Asp Asn Gly Ala His Ser Thr Pro Ser Ala Pro Ala 

900 905 
Ala His Ser Thr Pro Ser Ala Pro Gly Asp Ser Asn Ala 
915 920 925 

Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser Thr Pro 

930 9 35 940 

Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro Gly 

945 950 955 

Ala His Ser Thr Pro Ser Ala Pro Ala Asp Ser Ser Ala 

965 970 
Pro Ser He Pro Gly Asp Ser Ser Ala His Ser Thr Pro 

980 9 85 
Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala Pro Gly 
995 1000 1° 05 

Ala His Ser Thr Pro Ser Thr Pro Ala Asp Asn Gly Ala 

1010 io 15 1020 

Val Leu He Leu His Asp Gly Ala Ala Phe Ser Ala Phe 

1025 1° 30 1035 

Glv Leu Leu Leu Cys Ala Gly Ala Leu Leu Leu His Val 

1045 1050 

Ala Val Phe Phe 

1060 

(2) INFORMATION FOR SEQ ID NO : 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 



Asp Asn Gly 
720 

His Ser Thr 

735 
Ser Ala Pro 
750 

Asp Asn Gly 



His Ser Thr 

Ser Thr Pro 
800 

Asp Asn Gly 

815 
His Ser Thr 
830 

Ser Ala Pro 

Asp Ser Ser 

His Ser Thr 
880 

Ser Ala Pro 

895 
Asp Asn Gly 
910 

His Ser Thr 

Ser Thr Pro 

Asp Asn Gly 
960 

His Ser Thr 

975 
Ser Ala Pro 
990 

Asp Asn Gly 

Asn Gly Thr 

Ser Gly Gly 
1040 
Phe Val Met 
1055 
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VERIFICATION SUMMARY DATE: Jl/16/2002 

PATENT APPLICATION: US/09/955,909 TIME: 11:05:50 

input Set : N:\Crf3\RULE60\09955909.raw 
Output Set: N:\CRF3\01162002\I955909.raw 

L:3 0 M:220 C: Keyword misspelled or invalid format, [(A) J""^; .? TOBBR: 1 
L:31 M-.220 C: Keyword misspelled or invalid format, [(B) FILING DATE . 
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